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DETAILED ACTION 

Continued Examination Under 37 CFR 1.114 

1 . A request for continued examination under 37 CFR 1.114, including the fee set 
forth in 37 CFR 1 .17(e), was filed in this application after final rejection. Since this 
application is eligible for continued examination under 37 CFR 1.114, and the fee set 
forth in 37 CFR 1 .17(e) has been timely paid, the finality of the previous Office action 
has been withdrawn pursuant to 37 CFR 1.114. Applicant's submission filed on 
01/25/08 has been entered. 

Response to Arguments 

2. Applicant's arguments with respect to claims 6-13, and 31 - 42 have been 
considered but are moot in view of the new ground(s) of rejection. 

Claim Rejections - 35 USC § 103 

3. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

Claims 6 - 13 are rejected under 35 U.S.C. 103(a) as being unpatentable over 

Pitman et al., (US PAP 2002/0143530) in view Ellis et al., (US Patent 5,504,518), and 

further in viewjiang et al., (US Patent 6,901,362). 
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As per claim 6, Pitman et al., a feature quantity extracting apparatus comprising: 
a frequency transforming section for performing a frequency transform on a signal 
portion corresponding to a prescribed time length, which is contained in an inputted 
audio signal, to derive a frequency spectrum from the signal portion ("the audio signal is 
sampled and a frequency transform is performed on a succession of set of samples"; 
Abstract, lines 1 -3; paragraph 30, lines 5-7); 

a band extracting section for extracting a plurality of frequency bands from the 
frequency spectrum derived by the frequency transforming section and for outputting 
band spectra which are respective frequency spectra of the extracted frequency bands 
("frequency bands"; Abstract, lines 5, and 6; paragraph 32, line 11); and 

a feature quantity calculating section for calculating respective prescribed 
feature quantities of the band spectra, the feature quantity calculating section obtaining 
the calculated prescribed feature quantities as feature quantities of the audio signal 
("extract features from unknown audio content"; paragraph 33; paragraph 54). 

However, Pitman et al., do not specifically teach that the feature quantity 
calculating section calculates peak values corresponding to values at respective peaks 
of the band spectra, and obtains, as the prescribed feature quantities, values of 
difference between peak values of frequency bands, each of the peak values being of a 
greatest spectrum strength among local maximums of each of the band spectra. 

Ellis et al., teach that segment recognition sub-system may detect multiple 
matches on a given key signature for consecutive frames; examines the run structure in 
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the segment signature and generates an anticipated peak value width therefrom (coL45, 
lines 25-31). 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
the invention was made to generate a peak value among consecutive frames as taught 
by Ellis et al., in Pitman et al., because that would help better identify the audio content. 

Jiang et al., teach that the maximum local peak of the correlation function for 
each band is then located in a conventional manner (col.11, lines 10-12). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to locate the larger of the maximum local peak as taught 
by Jiang et al., in Pitman et al., in view of Ellis et al., because that would provide 
improved segmentation and classification of audio signals (col.1, lines 41 -43). 

As per claim 7, Ellis et al., further disclose the feature quantity calculating section 
uses binary values to represent the values of difference between peak values of 
frequency bands, the binary values indicating a sign of a corresponding one of the 
values of difference ("binary value"; col. 15, lines 37-40). 

As per claim 8, Pitman et al., a feature quantity extracting apparatus comprising: 
a frequency transforming section for performing a frequency transform on a signal 
portion corresponding to a prescribed time length, which is contained in an inputted 
audio signal, to derive a frequency spectrum from the signal portion ("the audio signal is 
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sampled and a frequency transform is performed on a succession of set of samples"; 
Abstract, lines 1 -3; paragraph 30, lines 5 - 7); 

a band extracting section for extracting a plurality of frequency bands from the 
frequency spectrum derived by the frequency transforming section and for outputting 
band spectra which are respective frequency spectra of the extracted frequency bands 
("frequency bands"; Abstract, lines 5, and 6; paragraph 32, line 11); and 

a feature quantity calculating section for calculating respective prescribed 
feature quantities of the band spectra, the feature quantity calculating section obtaining 
the calculated prescribed feature quantities as feature quantities of the audio signal 
("extract features from unknown audio content"; paragraph 33; paragraph 54). 

However, Pitman et al., do not specifically teach that the feature quantity 
calculating section calculates peak frequencies corresponding to frequencies at 
respective peaks of the band spectra, and obtains, as the prescribed feature quantities, 
numerical values related to the calculated peak frequencies; each of the peak values 
being of a greatest spectrum strength among local maximums of each of the band 
spectra. 

Ellis et al., teach that segment recognition sub-system may detect multiple 
matches on a given key signature for consecutive frames; examines the run structure in 
the segment signature and generates an anticipated peak value width therefrom (col.45, 
lines 25 -31). 



Application/Control Number: 10/667,465 Page 6 

Art Unit: 2626 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
the invention was made to generate a peak value among consecutive frames as taught 
by Ellis et al., in Pitman et al., because that would help better identify the audio content. 

Jiang et al., teach that the maximum local peak of the correlation function for 
each band is then located in a conventional manner (col.11, lines 10 - 12). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to locate the larger of the maximum local peak as taught 
by Jiang et al., in Pitman et al., in view of Ellis et al M because that would provide 
improved segmentation and classification of audio signals (col.1, lines 41 -43). 

As per claim 9, Ellis et al., further disclose that the feature quantity calculating 
section calculates, as the prescribed feature quantities, values of difference between 
peak frequencies of frequency bands ("detect multiple matches on a given key signature 
for consecutive frames, and generate an anticipated peak value"; col.45, lines 25 - 31). 

As per claim 10, Ellis et al., further disclose that the feature quantity calculating 
section represents the prescribed feature quantities using binary values indicating 
whether a corresponding one of the values of difference between peak frequencies of 
frequency bands is greater than a prescribed value ("one binary value for positive 
elements"; col. 1 5, lines 37 - 40). 
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As per claim 1 1 , Pitman et al., a feature quantity extracting apparatus 
comprising: a frequency transforming section for performing a frequency transform on a 
signal portion corresponding. to a prescribed time length, which is contained in an 
inputted audio signal, to derive a frequency spectrum from the signal portion ("the audio 
signal is sampled and a frequency transform is performed on a succession of set of 
samples"; Abstract, lines 1 -3; paragraph 30, lines 5 - 7); 

a band extracting section for extracting a plurality of frequency bands from the 
frequency spectrum derived by the frequency transforming section and for outputting 
band spectra which are respective frequency spectra of the extracted frequency bands 
("frequency bands"; Abstract, lines 5, and 6; paragraph 32, line 11); and 

a feature quantity calculating section for calculating respective prescribed 
feature quantities of the band spectra, the feature quantity calculating section obtaining 
the calculated prescribed feature quantities as feature quantities of the audio signal 
("extract features from unknown audio content"; paragraph 33; paragraph 54). 

the frequency transforming section extracts from the audio signal the signal 
portion corresponding to a prescribed time length at prescribed time intervals ("the 
audio signal is sampled and a frequency transform is performed on a succession of set 
of samples"; Abstract, lines 1 -3; paragraph 30, lines 5 - 7). 

However, Pitman et al., do not specifically teach a peak frequency calculating 
section for calculating peak frequencies corresponding to frequencies at respective 
peaks of the band spectra; and a peak frequency time variation calculating section for 
calculating, as the prescribed feature quantities, numerical values related to respective 
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time variation quantities of the peak frequencies calculated by the peak frequency 
calculating section; each of the peak values being of a greatest spectrum strength 
among local maximums of each of the band spectra. 

Ellis et al., teach that segment recognition sub-system may detect multiple 
matches on a given key signature for consecutive frames; examines the run structure in 
the segment signature and generates an anticipated peak value width therefrom (col.45, 
lines 25-31). 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
the invention was made to generate a peak value among consecutive frames as taught 
by Ellis et al., in Pitman et al., because that would help better identify the audio content. 

Jiang et al., teach that the maximum local peak of the correlation function for 
each band is then located in a conventional manner (col.11, lines 10-12). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to locate the larger of the maximum local peak as taught 
by Jiang et al., in Pitman et al., in view of Ellis et al., because that would provide 
improved segmentation and classification of audio signals (col.1, lines 41 -43). 

As per claim 12, Ellis et al., further disclose that the peak frequency time 
variation calculating section obtains, as the prescribed feature quantities, binary values 
indicating a sign of a corresponding one of the time variation quantities of the peak 
frequencies ("binary value"; col. 15, lines 37 - 40). 
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As per claim 13, Ellis et al., further disclose that the peak frequency time 
variation calculating section obtains, as the prescribed feature quantities, binary values 
indicating whether a corresponding one of the time variation quantities of the peak 
frequencies is greater than a prescribed value ("one binary value for positive elements"; 
col. 15, lines 37-40). 

4. Claims 31 - 42 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Pitman et al., (US PAP 2002/0143530) in view Ellis et al., (US Patent 5,504,518) 

As per claims 31 - 33, 35 -37, and 39 -41 , Pitman et al., teach a recording 
medium, and reproduction medium; and a feature quantity storage section which stores 
at least a set of a feature quantity of an audio signal and control instruction information 
associated therewith, (paragraph 54; paragraph 25). 

However Pitman et al., do not specifically teach receiving television program data 
containing an audio signal and a video signal, and is capable of recording the television 
program data to a recording medium, wherein the feature quantity extracting apparatus 
obtains a feature quantity of the audio signal contained in the television program data, 
wherein the program recording apparatus further comprises: a recording control section 
for controlling recording of the television program data to the recording medium; the 
audio signal containing music played in a television program to be recorded, the control 
instruction information instructing the recording control section to perform or stop 
recording of the television program; a feature quantity comparison section for 
determining whether the audio signal contained in the television program data matches 
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with the audio signal containing the music played in the television program based on 
both the feature quantity obtained by the feature quantity extracting apparatus and the 
feature quantity stored in the feature quantity storage section, and wherein when the 
feature quantity comparison section determines that the audio signal contained in the 
television program data matches with the audio signal containing the music played in 
the television program, the recording control section performs the control of performing 
or stopping recording of the television program data to the recording medium in 
accordance with an instruction indicated by control instruction information which is 
stored in the feature quantity storage section and associated with a feature quantity of 
the audio signal having been determined as matching with the audio signal containing 
the music played in the television program. 

Ellis et al., teach receiving television broadcast signals over a respective channel 
and demodulates the received signals to provide baseband video and audio signals. 
The video and audio signals are thereafter supplied to the segment recognition 
subsystem wherein frames signatures for each of the video and audio signals are 
generated which are thereafter compared to store key signatures to determine if a 
match exists (col. 9, lines 55 - 62). The FIR module serve to improve signature stability 
by averaging the audio spectral data over a number of television frames, thus to 
enhance the likelihood of obtaining correct signatures matches (col. 21, lines 64 - 67). 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
the invention was made to match audio in a television broadcast as taught by Ellis et al., 
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in Pitman et al., because that would help verify whether video and audio signals are 
synchronized during television broadcasting. 

As per claim 34, 38, and 42, Ellis et al., further disclose that the program 
reproduction control apparatus further comprises an editing section capable of editing 
the television program data recorded in the recording medium (updating a broadcast 
segment recognition database storing signatures"; col. 5, lines 2, and 3). 

Conclusion 

5. The prior art made of record and not relied upon is considered pertinent to 
applicant's disclosure. See PTO -892. 

6. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to LEONARD SAINT CYR whose telephone number is 
(571) 272-4247. The examiner can normally be reached on Mon- Friday. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Richemond Dorvil can be reached on (571) 272-7602. The fax phone 
number for the organization where this application or proceeding is assigned is (571)- 
273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 
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